A Spatial Visual Words of Discrete Image Scene for Indoor Localization
نویسندگان
چکیده
One of the fundamental problems in accurate indoor place recognition is the presence of similar scene images in different places in the environmental space of the mobile robot, such as the presence of computer or office table in many rooms. This problem causes bewilderment and confusion among different places. To overcome this, the local features of these image scenes should be represented in more discriminate and more robust way. However to perform this, the spatial relation of the local features should be considered. This study introduces a novel approach for place recognition based on correlation degree for the entropy of covariance feature vectors. In fact, these feature vectors are being extracted from the minimum distance of SIFT grid features of the image scene and optimized K entries from the codebook which is constructed by K means. The Entropy of Covariance features (ECV) issued to represent the scene image in order to remove the confusion of similar images that are related to different places. The conclusion observed from the acquired results showed that this approach has a stable manner due to its reliability in the place recognition for the robot localization and outperforms the other approaches. Finally, the proposed ECV approach gives an intelligent way for the robot localization through the correlation of entropy covariance feature vectors for the scene images.
منابع مشابه
Homography based Visual Bag of Word Model for Scene Matching in Indoor Environments
This paper proposes a data driven approach to perform scene localization in indoor environments. The proposed algorithm named p-BoW is designed to cope with self-repetitive and confusing patterns in indoor environments of any type. The algorithm uses the Visual Bag of Words (BoW) model along with proposed voting scheme to perform scene localization from a database of captured images. In the fir...
متن کاملMulti-Focus Image Fusion in DCT Domain using Variance and Energy of Laplacian and Correlation Coefficient for Visual Sensor Networks
The purpose of multi-focus image fusion is gathering the essential information and the focused parts from the input multi-focus images into a single image. These multi-focus images are captured with different depths of focus of cameras. A lot of multi-focus image fusion techniques have been introduced using considering the focus measurement in the spatial domain. However, the multi-focus image ...
متن کاملSelf-localization Using Visual Experience Across Domains
— In this study, we aim to solve the single-view robot self-localization problem by using visual experience across domains. Although the bag-of-words method constitutes a popular approach to single-view localization, it fails badly when it's visual vocabulary is learned and tested in different domains. Further, we are interested in using a cross-domain setting, in which the visual vocabulary is...
متن کاملFORNONI, CAPUTO: SALIENCY-DRIVEN POOLING FOR INDOOR SCENE RECOGNITION 1 Indoor Scene Recognition using Task and Saliency-driven Feature Pooling
Indoor scenes are characterized by a high intra-class variability, mainly due to the intrinsic variety of the objects in them, and to the drastic image variations due to (even small) view-point changes. One of the main trends in the literature has been to employ representations coupling statistical characterizations of the image, with a description of their spatial distribution. This is usually...
متن کاملNatural scene text localization using edge color signature
Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...
متن کامل